Baseline Estimation in Arabic Handwritten Text-Line - Evaluation on AHTID/MW Database

نویسندگان

  • Anis Mezghani
  • Slim Kanoun
  • Souhir Bouaziz
  • Maher Khemakhem
  • Haikal El Abed
چکیده

Baseline extraction is one of the most important phases for handwriting recognition. Due to the complexity of the Arabic scripts, baseline detection of Arabic handwritten text-lines is a difficult task compared to other languages. In this work, a method which combines some baseline extraction techniques used in literature was presented to provide a fine estimation of baseline in Arabic handwritten text-lines. For evaluation purpose, the AHTID/MW database was extended by a baseline ground truth annotation. The database is freely available for researchers worldwide which enable other researchers to test their baseline detection

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ICDAR2015 Writer Identification Competition using KHATT, AHTID/MW and IBHC Databases

Handwriting is considered to be one of the commonly used modality to identify persons in commercial, governmental and forensic applications. In order to record recent advances in the field of writer identification, we are proposing to organize the ICDAR2015 writer identification competition using KHATT, AHTID/MW and IBHC Databases. A first edition of the Arabic Writer Identification Competition...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Identification of Arabic/French Handwritten/Printed Words using GMM-Based System

The discrimination between languages is one of the first steps in the problem of automatic documents text recognition. In many documents, such as bank checks and application forms, printed and handwritten texts are mixed. In this paper, an automatic identification system of Arabic and French words in both handwritten and printed script based on Gaussian Mixture Models (GMMs) was presented. A fi...

متن کامل

Component-based Segmentation of Words from Handwritten Arabic Text

Efficient preprocessing is very essential for automatic recognition of handwritten documents. In this paper, techniques on segmenting words in handwritten Arabic text are presented. Firstly, connected components (ccs) are extracted, and distances among different components are analyzed. The statistical distribution of this distance is then obtained to determine an optimal threshold for words se...

متن کامل

A novel baseline estimation method for arabic handwritten text based on exploited components of voronoi diagrams

The goal of this paper is to present an efficient novel baseline estimation method for Arabic handwritten text based on the exploited components of Voronoi Diagrams (VD). The proposed based-VD method is constructed from three stages including: Preliminary stages, VD construction and baseline estimation process. The edges of the text are firstly extracted and then both inner and outer contour ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013